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METHOD AND APPARATUS TO CONSTRUCT A FIFTY PERCENT (50%) 
DUTY CYCLE CLOCK SIGNAL ACROSS POWER DOMAINS 

BACKGROUND 

1. Field 

[0001] Embodiments of the present invention relate to computers and, in 
particular, to microprocessor clock generators. 

2. Discussion of Related Art 

[0002] Microprocessors use many different clocks to synchronize the 
operations of the various circuits inside the microprocessor. For example, one 
clock may determine the speed at which the microprocessor core runs 
(microprocessor core clock). Some microprocessors are designed such that the 
microprocessor core clock has a duty cycle of approximately fifty percent. A 
clock generator is normally used to generate the microprocessor core clock. 

[0003] A limitation of clock signals is that their duty cycles are frequency 
dependent. The situation worsens when a clock signal propagates across power 
domains. For example, when the clock signal has to cross power domains, the 
clock signal pulse shape will change. The rising edges and falling edges of the 
clock signal will become asymmetrical (e.g., the duty cycle is no longer fifty 
percent). 
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[0004] One common way of ameliorating this situation is to have the power 
supply to the clock generator be the same as the power supply to the 
microprocessor core. For example, the power supply to the microprocessor and 
the clock generator can be maintained at 1.3 volts. 

[0005] This solution may be feasible to maintain duty cycle performance on a 
very small scale, but has its drawbacks. For example, this solution is not very 
versatile. In general, the more products the clock generator/ microprocessor 
combination is compatible with the better. Sometimes a server application may 
run better when the clock generator operates at 1.3 volts Vcc (or Vdd) and the 
microprocessor operates at 1.7 volts Vcc. A desktop application may run better 
when the clock generator operates at 1.5 volts Vcc and the microprocessor 
operates at 1.7 volts Vcc. A mobile application may run better when the clock 
generator operates at 1.3 volts Vcc and the microprocessor operates at 1.0 volt 
Vcc Thus, although adequate, this approach has its limitations. 

[0006] Another traditional way of making this situation better is to allow the 

clock to cross power domains but then in the destination domain to apply a 

divide-by-two function to the rising edges of the clock generator output to 

obtain the microprocessor core clock signal. Because the divide-by-two 

approach uses only the rising edges of clock generator output to determine 
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both rising and falling edges of the microprocessor core clock signal the clock 
generator output duty cycle is irrelevant to the microprocessor core clock signal 
duty cycle. As long as the clock generator output period is stable, the 
microprocessor core clock signal duty cycle will be fifty percent. Because of this 
same reason, the clock generator output can be distributed across power 
domains while maintaining microprocessor core clock signal symmetry. 

[0007] A drawback to this approach is that as technology advances its 
implementation becomes quite expensive. In general, the less expensive 
individual components are for a computer, the less expensive the computer. 
For a clock generator to work with state of the art microprocessors that operate 
at three gigahertz (GHz), for example, the clock generator must output a six 
GHz clock signal. As the frequency of the clock generator increases, its 
complexity increases. As the complexity of the clock generator circuit increases, 
it becomes a less attractive technique to construct a microprocessor core clock 
signal that has a duty cycle of approximately fifty percent across power 
domains. This is because the area that the clock generator consumes 
increases. The clock generator also consumes more power. 
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DETAILED DESCRIPTION OF THE ILLUSTRATED EMBODIMENTS 



[0008] In the drawings, like reference numbers generally indicate identical, 
functionally similar, and/ or structurally equivalent elements. The drawing in 
which an element first appears is indicated by the leftmost digit(s) in the 
reference number, in which: 

[0009] Figure 1 is a schematic diagram of a computer system according to an 
embodiment of the present invention; 

[0010] Figure 2 is a graphical representation of timing among circuits in the 
computer system in Figure 1 according to an embodiment of the present 
invention; and 

[0011] Figure 3 is a schematic diagram of an exclusive OR gate according to 
an embodiment of the present invention. 

DETAILED DESCRIPTION OF THE ILLUSTRATED EMBODIMENTS 

[0012] Figure 1 is a high-level block diagram of a computer system 100 

according to an embodiment of the present invention. The system 100 includes 

a clock generator 102 coupled to a clock buffer 104. The clock buffer 104 is 

coupled to parallel divide-by- two circuitry 106. The parallel divider circuitry 

106 is coupled to an exclusive OR (XOR) gate 108. 
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In the embodiment of the present invention illustrated in Figure 1, the clock 
generator 102, clock buffer 104, parallel divide-by-two circuitry 106, and XOR 
gate 108 are part of a microprocessor 110. In alternative embodiments of the 
present invention, the clock generator 102, clock buffer 104, parallel divide-by- 
two circuitry 106, and/ or the XOR gate 108 may be external to the 
microprocessor 110. The microprocessor 110 is coupled to a memory 111. 

[0013] The example clock generator 102 provides two output clocks 112 and 
114. An oscillator 103 inside the clock generator 102 outputs 112 and 114 
have a frequency /. The oscillator outputs 112 and 114 are complementary 
such that when the oscillator output 112 is high the oscillator output 114 is 
low and when the oscillator output 1 12 is low the oscillator output 1 14 is high. 
The oscillator outputs 112 and 114 have approximately a fifty percent duty 
cycle, however, even though they are one hundred eighty degrees out of phase 
with each other. 

[0014] In known microprocessor core clock generating circuits, both the 

clock signal and its complement are not used. Only one is used because one is 

adequate to accommodate relatively low operating frequencies. In embodiments 

of the present invention, both oscillator outputs 112 and 114 are coupled to 

and used by the clock buffer 104. As will be described below, a parallel divide 
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function is applied to generate four signals (i.e., //2, its complement, f/2 at 
ninety degrees out of phase from// 2 and its complement). The four signals are 
then applied to an XOR gate that recombines them to generate a clock signal 
that has a duty cycle of approximately fifty percent and the frequency /, which 
is not half {f/2) the frequency /of the oscillator outputs 112 and 114 but the 
same frequency /as the oscillator outputs 112 and 114. 

[0015] Embodiments of the present invention thus provide a clock generator 
102 /microprocessor 110 combination that may be compatible with more 
applications such a servers, desktops, and mobile computers. Other 
embodiments of the invention may be less expensive to implement because the 
frequency / of the microprocessor core clock is the same frequency / as the 
oscillator outputs 112 and 114. 

[0016] The example clock buffer 104 receives the oscillator outputs 112 and 
114 and buffers them to provide two clock signals 118 and 120, respectively. 
The dashed line 117 indicates that the clock generator 102 is in the power 
domain 116 and the clock buffer 104 is in the power domain 121. As a result, 
the duty cycles of the clock signals 118 and 120 are no longer fifty percent. 

[0017] Figure 2 is a graphical representation 200 of the timing in the system 

100 according to an embodiment of the present invention. A timing diagram 
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202 represents the clock signal 118 and a timing diagram 204 represents the 
clock signal 120. Note that the clock signals 118 and 120 are complementary 
such that when the clock signal 118 is high the clock signal 120 is low and 
when the clock signal 1 18 is low the clock signal 120 is high. The clock signals 
118 and 120 have the frequency /. Also note that the falling edges of clock 
signals 118 and 120 are slower than the rising edges of the clock signals. This 
is caused by the voltage of the power domain 121 being less than the voltage of 
power domain 116. Thus, while the oscillator outputs 112 and 114 have a fifty 
percent duty cycle, the clock signals 118 and 120 have duty cycle well above 
fifty percent. 

[0018] The clock signals 118 and 120 are input to the parallel divider 
circuitry 106. The parallel divider circuitry 106 includes four divide-by- two 
circuits 122, 124, 126, and 128. 

[0019] The example divide-by- two circuit 122, which is preset to logic "zero," 
(e.g., as indicated by "RESET") receives the clock signal 118 and outputs a 
signal 132 that is one-half the frequency of the clock signal 118 (or //2). A 
timing diagram 206 represents the signal 132. Note that the divide-by- two 
circuit 122 switches on the rising edge of the clock signal 1 18. 



[0020] The example divide-by- two circuit 126, which is also preset to logic 
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"zero," (e.g., as indicated by "RESET") receives the clock signal 120 and outputs 
a signal 134 that is one-half the frequency of the clock signal 120 (or //2). A 
timing diagram 208 represents the signal 134. Note that the divide-by- two 
circuit 126 switches on the rising edge of the clock signal 120 and the signal 
134 is ninety degrees out of phase with the signal 132. 

[0021] The example divide-by-two circuit 124, which is preset to logic "one," 
(e.g., as indicated by "SET") receives the clock signal 118 and outputs a signal 
136 that is one-half the frequency of the clock signal 118 (or f/2). A timing 
diagram 210 represents the signal 136. Note that the divide-by- two circuit 124 
switches on the rising edge of the clock signal 118 and the signal 136 is the 
complement to the signal 132. 

[0022] The example divide-by- two circuit 128, which also is preset to logic 
"one," (e.g., as indicated by "SET") receives the clock signal 120 and outputs a 
signal 138 that is one-half the frequency of the clock signal 120 (or f/2), A 
timing diagram 212 represents the signal 138. Note that the divide-by-two 
circuit 128 switches on the rising edge of the clock signal 120 and the signal 
138 is the complement to the signal 134. 

[0023] Of course, embodiments of the present invention may be implemented 

other than as described above. For example, embodiments may be 
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implemented by having the divide-by- two circuit 122 preset to logic "one," the 
divide-by-two circuit 124 preset to logic "zero," the divide-by-two circuit 126 
preset to logic "one," and the divide-by- two circuit 128 preset to logic "zero." 
After reading the description herein, a person of ordinary skill in the relevant 
art will readily recognize how to implement such embodiments of the present 
invention. 

[0024] The XOR gate 108 includes three NAND gates 140, 142, and 144. The 
NAND gate 140 receives the signals 132 and 138. When the signals 132 and 
138 change to become the same state the NAND gate 140 changes states from 
logic "one" to logic "zero" or vice versa. The output of the NAND gate 142 is a 
signal 148. 

[0025] The NAND gate 142 receives the signals 134 and 136. When the 
signals 134 and 136 change to become the same state the NAND gate 142 
changes states from logic "one" to logic "zero" or vice versa. The output of the 
NAND gate 142 is a signal 150. 

[0026] The NAND gate 144 receives the signals 148 and 150. When the 

signals 148 and 150 change to become the same state the NAND gate 144 

changes states from logic "one" to logic "zero" or vice versa. The output of the 

NAND gate 144 is the clock signal 160. Because the signals 132/136 and 
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134/138 are two sets of complementary signals that are ninety degrees out of 
phase with each other, the XOR gate 108 may be used to recombine the signals 
132/136 and 134/ 138 to arrive at the clock signal 160. 

[0027] The timing diagram 214 shows the clock signal 160. Note that the 
clock signal 160 has a duty cycle of approximately fifty percent and has a 
frequency / that is the same as the frequency /of the oscillator output 112. In 
one embodiment, the duty cycle of the resulting clock signal ranges from 
approximately 0.475 (47.5%) and 0.525 (52.5%). In an alternative embodiment 
of the present invention, the duty cycle of the resulting clock signal ranges 
from approximately 0.493 (49.3%) to 0.514 (51.4%). Other embodiments may 
have narrower duty cycles. 

[0028] The clock generator 102 may be any circuit that generates a signal 
suitable for use as a clock signal. For example, the clock generator 102 may be 
a phase-locked loop clock generator having a voltage-controlled oscillator 
(VCO). Suitable clock generators are known. 

[0029] The clock buffer 104 may be any circuit that is capable of buffering 
the output of the clock generator 102. Suitable buffers are known. 



[0030] Microprocessors suitable for implementing the microprocessor 110 
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are known. A suitable microprocessor includes an Intel® Pentium® 4 
microprocessor available from Intel® Corporation in Santa Clara, California. 



[0031] The memory 111 may be any known dynamic random access memory 
(DRAM), static RAM (SRAM), Flash memoiy, etc. 

[0032] In the embodiment shown in Figure 2, the microprocessor 110, and 
the clock buffer 104 are shown in the power domain 121 and the clock 
generator 102 is shown in the power domain 116. In an alternative 
embodiment, the memory 111 is in the power domain 170 and the 
microprocessor 110 is in the power domain 172, as indicated by the dashed 
line 174. The power domains 116, 121, 170, and 172 may be analog power 
domains and/or digital power domains. In either case, the clock signal 160 
constructed from the oscillator outputs 112 and 114 has a duty cycle of 
approximately fifty percent and a frequency / that is the same frequency / as 
the oscillator outputs 112 and 114. 

[0033] The timing from one input to one output of either of the NAND gates 

140, 142, and/ or 144 may not be the same. This is because their inputs, 

whose rise times and fall times may be different, come from four different 

paths. As a result, the signals 148 and 150 may not be symmetrical. When the 

signals 148 and 150 are not symmetrical the signal 160 may not be stable. 
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[0034] Figure 3 is a schematic diagram showing the XOR gate 108 in more 
detail according to an embodiment of the present invention in which the XOR 
gate 108 is balanced to provide symmetrical outputs for the NAND gates 140 
and 142. The symmetrical signals 148 and 150 provide a more stable signal 
160. 

[0035] The XOR gate 108 includes eight inverters 302, 304, 306, 308, 310, 
312, 314, and 316, and six NAND gates 320, 322, 324, 326, 328, and 330. Vcc 
from the power domain 121 is coupled to each inverter 302, 304, 306, 308, 
310, 312, 314, and 316, and each NAND gates 320, 322, 324, 326, 328, and 
330. 

[0036] The inverter 302 is coupled to drive the signal 132 to the inverter 304, 
which drives one input of the NAND gate 320 and one input of the NAND gate 
322. The inverter 306 is coupled to drive the signal 138 to the inverter 308, 
which drives the other input of the NAND gate 322 and the other input of the 
NAND gate 320. 

[0037] The outputs of the NAND gates 320 and 322, which are now 

symmetrical, are applied to one input of the NAND gate 328 and one input of 

the NAND gate 330. The outputs of the NAND gates 324 and 326, which also 
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are now symmetrical, are applied to the other input of the NAND gate 328 and 
the other input of the NAND gate 330. The outputs of the NAND gates 328 and 
330, which are now symmetrical, are combined to generate the signal 160. 

[0038] Embodiments of the present invention may be implemented using 
hardware, software, or a combination thereof. In implementations using 
software, the software may be stored on a machine-accessible medium. 

[0039] A machine-accessible medium includes any mechanism that provides 
(i.e., stores and/ or transmits) information in a form accessible by a machine 
(e.g., a computer, network device, personal digital assistant, manufacturing 
tool, any device with a set of one or more processors, etc.). For example, a 
machine-accessible medium includes recordable and non-recordable media 
(e.g., read only memory (ROM), random access memory (RAM), magnetic disk 
storage media, optical storage media, flash memory devices, etc.), as well as 
electrical, optical, acoustic, or other form of propagated signals (e.g., carrier 
waves, infrared signals, digital signals, etc.). 

[0040] Of course, embodiments of the present invention are not limited to 
clocks for microprocessors. For example, embodiments of the present invention 
may be used in other systems (e.g., input/output (I/O) systems) to maintain a 
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duty cycle of approximately fifty percent cycle across power domains. 



[0041] The above description of illustrated embodiments of the invention is 
not intended to be exhaustive or to limit embodiments of the invention to the 
precise forms disclosed. While specific embodiments of, and examples for, the 
invention are described herein for illustrative purposes, various equivalent 
modifications are possible, as those skilled in the relevant art will recognize. 
These modifications can be made in light of the above detailed description. 

[0042] In the above description, numerous specific details, such as 
particular processes, materials, devices, and so forth, are presented to provide 
a thorough understanding of embodiments of the invention. One skilled in the 
relevant art will recognize, however, that the embodiments of the present 
invention can be practiced without one or more of the specific details, or with 
other methods, components, etc. In other instances, well-known structures or 
operations are not shown or described in detail to avoid obscuring the 
understanding of this description. 

[0043] Various operations have been described as multiple discrete 

operations performed in turn in a manner that is most helpful in 

understanding embodiments of the invention. However, the order in which they 

are described should not be construed to imply that these operations are 
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necessarily order dependent or that the operations be performed in the order in 
which the operations are presented. 



[0044] Reference throughout this specification to "one embodiment" or "an 
embodiment" means that a particular feature, structure, process, block, or 
characteristic described in connection with the embodiment is included in at 
least one embodiment of the present invention. Thus, the appearances of the 
phrases "in one embodiment" or "in an embodiment" in various places 
throughout this specification are not necessarily all referring to the same 
embodiment. Furthermore, the particular features, structures, or 
characteristics may be combined in any suitable manner in one or more 
embodiments. 
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